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Abstract 

Background: In this population-based study, we investigated the degree of concordance between Gleason scores 
obtained from prostate biopsies and those obtained from prostatectomy specimens, as well as the determinants of 
biopsy understaging. 

Methods: We considered for this study all 371 prostate cancer patients recorded at the Geneva Cancer Registry 
diagnosed from 2004 to 2006 who underwent a radical prostatectomy. We used the kappa statistic to evaluate the 
Gleason score concordance from biopsy and prostatectomy specimens. Logistic regression was used to determine 
the parameters that predict the undergrading of the Gleason score in prostate biopsies. 

Results: The kappa statistic between biopsy and prostatectomy Gleason score was 0.42 (p < 0.0001), with 67% 
of patients exactly matched, and 26% (n = 95) patients with Gleason score underestimated by the biopsy. In a 
multi-adjusted model, increasing age, advanced clinical stage, having less than ten biopsy cores, and longer delay 
between the two procedures, were all independently associated with biopsy undergrading. In particular, the 
proportion of exact match increased to 72% when the patients had ten or more needle biopsy cores. The main 
limitation of the study is that both biopsy and prostatectomy specimens were examined by different laboratories. 

Conclusions: The data show that concordance between biopsy and prostatectomy Gleason scores lies within the 
classic clinical standards in this population-based study. The number of biopsy cores appears to strongly impact on 
the concordance between biopsy and radical prostatectomy Gleason score. 
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Background 

The Gleason grading system, based upon architectural 
features of prostate cancer cells, is the most widely used 
histological grading method for prostatic adenocar- 
cinoma. The Gleason score (GS) closely correlates with 
clinical behavior, and provides an important index of 
prognosis. Furthermore, this score is one of the key de- 
terminants in treatment decision making, together with 
stage, age and prostate-specific antigen [1]. Indeed, most 
models who were developed to predict the likelihood of 
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pre-treatment findings with long-term outcomes, incor- 
porate biopsy GS [1]. 

Significant discrepancies exist between the GS deter- 
mined by the prostate biopsy, and the GS based upon 
the pathologic specimen. It has been observed that the 
GS from needle biopsies underestimates the GS of the 
radical prostatectomy specimen in 19 to 57% of all cases 
depending on the series and the periods examined [2-8]. 
Such downgrading has a significant impact on treatment 
decisions and patient outcomes, particularly when the 
choice of treatment is between active surveillance and 
curative intent therapy. 

Several factors can influence the likelihood that the bi- 
opsy GS underestimates the prostatectomy score, inclu- 
ding the PSA level, the level of pathologist expertise, the 
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patients age, the results of the digital rectal examination, 
the prostate gland volume, the percentage of cancer cells 
in the biopsy sample and the number of biopsies 
obtained [5,7,9-11]. With reference to the last issue, it 
is worth mentioning that the European Association of 
Urologists (EAU) in 2008 recommended obtaining at 
least 10 cores during the biopsy [12], while in the most 
recent guidelines reduced this number to eight [13]. 

In this population-based study, we investigated the de- 
gree of concordance between GSs obtained from needle 
biopsy of the prostate and radical prostatectomy, and we 
assessed the parameters that may be related to un- 
dergrading the GS in prostate biopsy. 

Methods 

Using data from the population-based Geneva Cancer 
Registry, we considered for this study all 371 prostate 
cancer patients diagnosed from 2004 to 2006, who 
underwent a radical prostatectomy. 

The Geneva Cancer Registry collects information from 
various sources and is considered accurate, as witnessed 
by its very low percentage (<2%) of cases recorded from 
death certificates only [14]. All hospitals, pathology la- 
boratories, and private practitioners in the canton of 
Geneva are requested to report every cancer case. 
Trained tumor registrars systematically extract data from 
medical and laboratory records and physicians regularly 
receive enquiry forms to complete missing data. 

Recorded data for each cancer case include sociodemo- 
graphic variables; method of detection; prostate-specific 
antigen (PSA) value at diagnosis; tumor characteristics 
(including histology, differentiation based on tumor grade, 
and GS); stage at diagnosis according to both clinical and 
pathological TNM classification; lymph node status; treat- 
ment; survival status; and cause of death. Different la- 
boratories, from both the public and the private sector, 
examine biopsy and prostatectomy specimens in Geneva. 
For this study, the variables of interest were age, socioeco- 
nomic status based on the patients last occupation (high, 
middle, low, unknown), sector of care of the first treat- 
ment (public, private), PSA value at diagnosis (<10 ng/ml, 
10-20, >20) and clinical stage (T1-T2, T3-T4, Tx). For 
each patient, we retrieved information on the number of 
biopsy cores and the volume of the prostate in cubic centi- 
meters (cm 3 ). We also calculated the number of biopsy 
procedures performed by each operator during the study 
period within the study population as an indicator of the 
experience of the biopsy performers. To account for the 
possibility of a change in the severity of the disease occur- 
ring during the delay between the biopsy and the prosta- 
tectomy, we calculated the difference in days between the 
prostatectomy and the biopsy dates. 

Concordance between biopsy and prostatectomy GSs 
was calculated through the kappa statistic [15]. The kappa 



statistic corrects for agreement expected by chance: kappa 
values < 0 indicates poor agreement; 0 to < 0.20 indicate 
slight agreement; 0.21 to < 0.40 indicate fair agreement; 
0.41 to < 0.60 indicate moderate agreement; 0.61 to < 0.80 
indicate substantial agreement; and 0.81 to < 1.0 indicate 
almost perfect agreement [16]. We calculated kappa for 
the individual GS (2 to 10) and for categories of GS (<6, 7, 
8-10). Rates of undergrading of GS categories were 
calculated and with logistic regression we assessed deter- 
minants of GS categories undergrading. We used a case- 
control approach considering as cases those patients 
whose biopsy GS was lower than the prostatectomy GS, 
and as controls everybody else. In the multivariate analysis 
we used the back step procedure, entering all the variables 
statistically significant in univariate analysis at the same 
time. 

All data analysis was conducted at the Geneva Cancer 
registry. The Registry has general registry approval by 
the Swiss Federal Commission of Experts for professional 
secrecy in medical research (Commission d experts pour 
le secret professionnel en matiere de recherche medical). 
This approval permits cancer data collection and its use 
for research purposes. 

Results 

The prostate cancer patients were on average 63 (±6.3) 
years old at diagnosis. For 242 (65%) patients, the PSA 
value at diagnosis was below 10 ng/ml; the clinical stage 
was classified as Tl-T2a for 73% of patients. The median 
prostate volume was 36.6 cc (range 4-166) and the me- 
dian number of biopsy cores was 10 (range 2-22). Thirty 
percent of patients had a biopsy by a physician per- 
forming less than 10 biopsies in the three-year study 
period. The median delay between biopsy and prostatec- 
tomy was 84 days (range 7-412). 

Figure 1 shows the concordance between biopsy and 
prostatectomy individual GSs. An exact match was obser- 
ved for 67% of patients (n = 248) while for 26% (n = 95) 
the GS was underestimated by the biopsy; in 7% of pa- 
tients, it was overestimated. The kappa statistic was 0.42, 
indicating a moderate agreement. When considering the 
categories of GS (<6, 7, 8-10) the patients understaged by 
the biopsy were 87 (23%) while the GS matched for 267 
patients (72%). The kappa for the categories of GS was 
0.48 (p< 0.0001), also indicating a moderate agreement. 
Over 30% of patients classified as Gleason score six or less 
from the biopsy, were Gleason score seven or more 
according to the prostatectomy. 

The concordance significantly differed across catego- 
ries of needle cores, number of biopsies performed by 
the operator and time between the biopsy and the pros- 
tatectomy. The kappa statistic for the patients who had 
nine or less biopsy cores was 0.35 (p < 0.001) and the 
proportion of concordance was 62%. For the patients 
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Figure 1 Concordance between biopsy and prostatectomy 
individual Gleason scores. Empty circles represent concordant 
scores between biopsy and prostatectomy; red filled circles 
represent discordant scores. The size is proportional to the number 
of cases falling in each combination. 



who had 10 or more biopsy cores the kappa statistic was 
0.49 (p < 0.001) and the proportion of concordance in- 
creased to 72%. The kappa statistic between biopsy and 
prostatectomy GS was 0.34 (p < 0.001) for the patients 
who had their biopsy performed by an operator with less 
than 30 procedures during the study period, and 0.44 
(p < 0.001) if the operator performed 30 or more proce- 
dures. The kappa statistic for patients who had their 
prostatectomy within the median value (84 days, 2.7 - 
months) from the biopsy was 0.54 (p < 0.001), the sta- 
tistic for the group who had a longer delay was 0.31 
(p < 0.001). 

In the univariate logistic regression analysis, the var- 
iables associated with biopsy undergrading (cases = 95) 
were age, sector of care of the first treatment, clinical 
stage, number of biopsy cores, prostate volume, number 
of procedures performed by the biopsy operator, and 
the delay between the biopsy and the prostatectomy 
(Table 1). In the multivariate analysis the variables inde- 
pendently associated with undergrading selected for the 
model were age, with a 4% increase by each extra year of 
age (OR: 1.04, 95% CI: 1.00-1.09; p = 0.038), clinical stage 
(OR: T3-T4 vs. T1-T2: 1.81, 95% CI: 1.06-3.10; p = 0.030), 
number of cores (OR <9 cores vs. 10+: 1.93, 95% CI: 
1.16-3.22; p = 0.011), and delay between biopsy and 
prostatectomy, with 1% increase by each extra day of 
delay between the 2 procedures (OR: 1.01, 95% CI: 
1.00-1.01, p = 0.018) (Table 1). 

Discussion 

This study shows that in Geneva, the concordance be- 
tween the GSs of biopsy and those of prostatectomy is 



moderate, and lies within the current standards. Our 
study also confirms that a lower number of biopsy cores 
is an important and independent determinant of biopsy 
undergrading. 

The finding of a 67% exact match in grading between 
biopsy and prostatectomy observed in Geneva is similar 
to the proportion found in other studies. Muntener et al. 
and Rajinikanth et al. both found exact match between 
biopsy and prostatectomy GS categories as in 69% of 
patients [4,8]. In particular the latter found that the 
exact match improved from 58% in 1992-1996 to 75% 
in 2002-2006, which represents a period closer to our 
study period [8]. 

As reported by other authors, we found that a higher 
number of biopsy cores is associated with a decreased 
risk of undergrading [10,17]. In our study, the propor- 
tion of exact matching improved significantly, from 62% 
to 72%, when the patients had ten or more cores com- 
pared with those who had nine or less, reinforcing the 
2008 EAU guidelines recommendations of a minimum 
of ten systematic cores per biopsy [12]. However, the 
most recent EAU guidelines reduced to eight the mini- 
mum number of biopsy cores recommended, suggesting 
not to exceed twelve, with adaptations according to the 
prostate volume [13], with the likely intent to keep a 
high validity and at the same time reduce the patients dis- 
comfort and the number of biopsy complications. Serious 
complications may, in fact, occur after a prostate biopsy, 
infection being one of the most serious ones [18]. 

As opposed to the report from Antunes et al. [10], we 
found that the effect of the number of biopsy cores on 
the risk of undergrading did not depend on the radical 
prostatectomy specimen volume in multivariate analysis. 

In our study, older age and a higher clinical stage of 
the disease at diagnosis were also predictors of biopsy 
undergrading, consistently with previous investigations 
[9,11,17,19,20]. This could be linked to increasing intra- 
tumoral heterogeneity of prostate cancer among men 
with advanced age and advanced clinical stage, which 
could affect clinical estimates of the grade of malig- 
nancy [21-23]. 

We found that the concordance was much higher 
among the group of patients with a shorter delay bet- 
ween the biopsy and the prostatectomy (<= 84 days, 
kappa = 0.54) than in the group with a delay longer than 
84 days (kappa = 0.31). For this reason we adjusted our 
multivariate model by the delay between the two proce- 
dures. Consistently with our results, Kvale et al. found 
that upgrading increased with increasing interval from 
biopsy to radical prostatectomy [17]. Delay between bi- 
opsy and surgery is more likely to have occurred in 
lower grade tumors and may well explain the higher dis- 
cordance with the final specimen. However, a histo- 
logical dedifferentiation of the tumor over time has been 
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Table 1 Determinants of Gleason score undergrading between biopsy and prostatectomy 





Total 


Cases 


Controls 


Univariate OR 1 (95% CI) 


p-value 


Multi-adjusted 

OR 1 ' 2 (95% CI) 


p-value 




(n = 371) 


(n = 95) 


(n = 276) 










Age at diagnosis years (in continuous ) 








1.04 (1.00-1.08) 


0.049 


1.04 (1.00-1.09) 


0.038 


Socioeconomic status 
















High 


128 


31 


97 


1.00 








Middle 


155 


43 


112 


1 .20 (0.70-2.05) 


0.502 






Low 


82 


20 


62 


1.01 (0.53-1.93) 


0.977 






Unknown 


6 


1 


5 


0.63 (0.07-5.56) 


0.674 






Sector of care 
















Private 


235 


48 


187 


1.00 








Public 


136 


47 


89 


2.06 (1.28-3.31) 


0.003 






PSA value 
















<10 


242 


63 


179 


1.00 








10-20 


39 


14 


25 


1 .59 (0.78-3.25) 


0.203 






>20 


12 


4 


8 


1.42 (0.41-4.88) 


0.577 






Unknown 


78 


14 


64 


0.62 (0.33-1.19) 


0.149 






Clinical stage 
















T1-T2 


272 


61 


211 


1.00 




1.00 




T3-T4 


93 


33 


60 


1.90 (1.14-3.17) 


0.014 


1.81 (1.06-3.10) 


0.030 


Tx 


6 


1 


5 


0.69 (0.08-6.03) 


0.739 


0.58 (0.06-5.37) 


0.628 


Number of biopsy cores 
















< 9 


175 


56 


1 19 


2.02 (1.24-3.28) 


0.005 


1.93 (1.16-3.22) 


0.01 1 


10 + 


185 


35 


150 


1.00 




1.00 




Unknown 


11 


4 


7 


2.45 (0.68-8.83) 


0.171 


2.65 (0.70-10.0) 


0.151 


Prostate volume cm3 (in continous) 








0.99 (0.97-1.00) 


0.031 






Number of procedures performed by the 
biopsy operator 
















1-29 


110 


37 


73 


1.75 (1.07-2.87) 


0.026 






30+ 


254 


57 


197 


1.00 








Unknown 


7 


1 


6 


0.58 (0.07-4.88) 


0.613 






Delay between biopsy and prostatectomy 

days (in continous) 








1.01 (1.00-1.01) 


0.004 


1.01 (1.00-1.01) 


0.018 



1) OR: Odds Ratio derived from logistic regression considering as cases patients whose biopsy Gleason score was lower than the prostatectomy Gleason score and 
as controls everybody else, 2) Simultaneously adjusted for age, pathological stage, number of biopsy cores and delay between the two procedures. 



demonstrated by some authors, but its association with 
disease progression is not clear [21-23]. 

This study has some limitations. Most importantly, both 
biopsy and prostatectomy specimens were examined by 
different laboratories, leading to possible inter-observer 
variability common in GS interpretation, particularly for 
less experienced pathologists [24]. In our study, however, 
two main laboratories analyzed approximately 92% of all 
prostate biopsies and approximately 70% of all prostate 
surgical samples of prostate cancer cases, which suggests 
a substantial experience by the examining laboratories, al- 
though we cannot rule out variability between observers. 
Furthermore, we found no significant difference of con- 
cordance between biopsy and prostatectomy GS by the 



average number of biopsy analyzed by each pathologist 
(data not shown). 

The strength of this study is its population-based ap- 
proach and that it covers a recent period of time, i.e. 
2004-2005. The period chosen overlaps with a refine- 
ment of the Gleason grading system promoted in 2005 
by the International Society of Urological Pathology con- 
sensus conference (ISUP) [25]. However, we estimate 
that this revision had a negligible impact on our results. 
In fact, contrary to other studies which show after the 
ISUP revision a tendency towards a higher Gleason prog- 
nostic due mainly to an upgrading of the secondary pat- 
tern, we did not observe any difference in the proportion 
of high vs. low grade cancers by year of diagnosis [26-28] . 
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Similarly, we did not detect a difference in neither the pri- 
mary nor the secondary Gleason pattern, for both the bi- 
opsy and the prostatectomy (p-value Pearson chi-square 
patternl and pattern 2 biopsy: 0.856 and 0.663; patternl 
and pattern2 prostatectomy: 0.983 and 0.679, respec- 
tively). Biopsy GS is an important outcome predictor of 
prostate cancer, and especially so for patients treated con- 
servatively. Of particular clinical relevance is the biopsy 
undergrading of patients from grade seven or more to 
grade six or less, as these patients would move from a 
high/intermediate to a low risk group with a high prob- 
ability of receiving a less aggressive treatment approach. 
In our study, there were 147 patients with T1-T2 prostate 
cancer stage, PSA < 10 ng/ml, and a biopsy Gleason 
grade <7 who would have been ideal candidates for 
active surveillance. After the prostatectomy the Gleason 
was upgraded to a grade > =7 for 36 of these patients. 
Thus, in this population-based sample approximately 10% 
of patients with high-risk disease would have been 
undertreated. These results support the active surveillance 
protocols to rebiopsy patients one year after their diagno- 
sis to reduce the proportions of understaged patients to 
approximately 5% [29]. 

Conclusions 

In conclusion, this study confirms that number of biopsy 
cores impacts the concordance between Gleason of bi- 
opsy and prostatectomy. Several actions can be taken to 
optimize concordance of prostate cancer grading bet- 
ween needle biopsy and the prostatectomy specimen. 
Most importantly, adequate training of urologists in 
performing biopsies has the potential not only to op- 
timize diagnostic accuracy but to minimize the patients 
discomfort and morbidity as well. Extended needle bi- 
opsy protocols, with at least ten cores, improve cancer 
detection rates and the concordance of biopsy results 
with the prostatectomy findings. All pathologists evalu- 
ating prostate tissues should be systematically trained in 
Gleason grading. Furthermore, as recently recommended 
by the National Institute for Health and Clinical Ex- 
cellence guidelines [1], results of all prostate biopsies 
should be reviewed by a urological cancer multidisciplin- 
ary team. Men who have selected active surveillance as 
treatment option should have a repeat biopsy after a re- 
view by this team of the risk characteristics, including 
life expectancy, PSA level, digital rectal examination, and 
prostate volume. Use of modern imaging devices, such 
as three dimensional ultrasound during needle biopsy, 
could further help to improve diagnostic accuracy prior 
to clinical decision making. 

Abbreviations 
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